Analysing Effect of Database Grouping on Multi-Database Mining
نویسندگان
چکیده
In many applications we need to synthesize global patterns in multiple large databases, where the applications are independent of the characteristics of local patterns. Pipelined feedback technique (PFT) seems to be the most effective technique under the approach of local pattern analysis (LPA). The goal of this paper is to analyse the effect of database grouping on multi-database mining. For this purpose we design a database grouping algorithm. We introduce an approach of non-local pattern analysis (NLPA) by combining database grouping algorithm and pipelined feedback technique for multi-database mining. We propose to judge the effectiveness of non-local pattern analysis for multi-database mining. We conduct experiments on both real and synthetic databases. Experimental results show that the approach to non-local pattern analysis does not always improve the accuracy of mining global patterns in multiple databases. Index Terms — Local pattern analysis, Multi-database mining, Non-local pattern analysis, Pipelined feedback technique, Synthesis of patterns
منابع مشابه
A Survey on Document Clustering For Identifying Criminal
Crimes are a social nuisance and cost our society dearly in several ways. Crime investigation has very significant role of police system in any country. Developing a good crime analysis tool to identify crime patterns quickly and efficiently for future crime pattern detection is required. This paper presents combine approach of clustering, outlier detection and providing the rule engine to iden...
متن کاملGeometric clustering models for multimedia databases
Recently, in the elds of information retrieval, Data Mining, or Knowledge Discovery in Databases (KDD), is intensively studied to extract implicit useful information from large amount of data. One of the important objectives of KDD is to obtain generalizations by grouping similar objects via clustering. In the case of multi-media databases such as full text database and image database, geometri...
متن کاملData sanitization in association rule mining based on impact factor
Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...
متن کاملUsing Data Mining and Three Decision Tree Algorithms to Optimize the Repair and Maintenance Process
The purpose of this research is to predict the failure of devices using a data mining tool. For this purpose, at the outset, an appropriate database consists of 392 records of ongoing failures in a pharmaceutical company in 1394, in the next step, by analyzing 9 characteristics and type of failure as a database class, analyzes have been used. In this regard, three decision tree algorithms have ...
متن کاملSocial Network Trend Analysis Using Frequent Pattern Mining and Self Organizing Maps
A technique for identifying, grouping and analysing trends in social networks is described. The trends of interest are defined in terms of sequences of support values for specific patterns that appear across a given social network. The trends are grouped using a SOM technique so that similar trends are clustered together. A cluster analysis technique is then applied to identify “interesting” tr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Intelligent Informatics Bulletin
دوره 12 شماره
صفحات -
تاریخ انتشار 2011